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Abstract 

We provide an empirical investigation aimed at uncovering the statistical properties of intricate stock trading networks 
based on the order flow data of a highly liquid stock (Shenzhen Development Bank) listed on Shenzhen Stock Ex- 
change during the whole year of 2003. By reconstructing the limit order book, we can extract detailed information 
of each executed order for each trading day and demonstrate that the trade size distributions for different trading days 
exhibit power-law tails and that most of the estimated power-law exponents are well within the Levy stable regime. 
Based on the records of order matching among investors, we can construct a stock trading network for each trading 
day, in which the investors are mapped into nodes and each transaction is translated as a direct edge from the seller to 
the buyer with the trade size as its weight. We find that all the trading networks comprise a giant component and have 
power-law degree distributions and disassortative architectures. In particular, the degrees are correlated with order 
sizes by a power-law function. By regarding the size executed order as its fitness, the fitness model can reproduce the 
empirical power-law degree distribution. 
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1. Introduction 

Most people believe that the global economic and financial systems exhibit more and more remarkably intertwined 
nature with a continuing increase in economy globalization using different measures Hl-Ql, although some measures 
give a different scenario which provides a diffusion path for the US subprime mortgage lending crisis triggering 
the current global financial and economic crisis, mainly in the developed economies ||7[]. There is no doubt that 
studying the structure and dynamics of financial and economic networks will provide revolutionary insights to our 
understanding of the evolution of financial and economic systems, which have predictive implication for policy makers 
1^ . The ideal picture would be that we have a unified network for all economic units and the detailed information 
of their interactions. Unfortunately, such a database is almost impossible to build. Nevertheless, the importance of 
network analysis of financial and economic systems is self-evident, especially in the Econophysics community. 

In recent years, complex network theory has witnessed a flourishing progress in many interdisciplinary natural 



and social fields |I0l - iI5ll . Quite a few network patterns differing from random graphs have been revealed, such as 



small world Ill0lll6ll . scale-free degree distribution III III , community ||I7| - |20|1 . and rich club Il21l - i23[l . to list a few. 
Complex financial and economic networks can be classified in three categories. In the first category of networks, the 
nodes present financial or economic agents (economies, companies, financial institutions, traders, et al) and a link is 
drawn between two nodes if they have certain interactions (such as investment, trade, lending, economic cooperation. 



et al) Il24l - l35[l . In the second category of networks, the nodes are agents, each of which has a time series recording 
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its behavior and a link is drawn due to the correlation between two time series |3^42]. The third category contains 
networks converted from individual financial time series l43l - l47ll . based on diff'erent mapping methods such as phase 
space embedding, visibility algorithm, sub-series correlation, recurrence, and n-tuple occurrence 1481 - 15 51] . 

The complex financial and economic networks of first type range from the macroscopic level of countries to the 
microscopic level of traders in markets. Trading networks at the microscopic are less studied, since the detailed 
information of trader identities and their transactions is usually unavailable to researchers with merely a few excep- 
tions. Kyriakopoulos et al have investigated the statistical properties of the transaction network of all major financial 
players (423 accounts) within Austria over one year Js^]. They found that the directed network is disassortative and 
the random matrix analysis is able to identify an account with financial misconduct. Tseng et al conducted a series 
of web-based prediction market experiments for 97 days with 2095 effective participants and 16936 transactions to 
explore the structure of transaction networks and to study the dynamics of wealth accumulation 15711 . They found that 
1 5811 . the trading networks are scale-free and disassortative. These empirical properties can be well reproduced by a 
zero-intelligence agent-based market model, outperforming the zero-intelligence plus model lf59ll and the Gjerstad- 
Dickhaut model 16011. Recently, trading networks constructed from the September 2009 E-mini S&P 500 futures 
contracts were analyzed by Adamic et al ll6 111 . They found that there are contemporaneously correlated relationship 
between network metrics (centralization, assortative index, clustering coefficient, and large strongly connected com- 
ponent) and financial variables (returns, volume, duration, and market liquidity). By constructing trading network 
from the real trading records from Shanghai Futures Exchange, Wang and Zhou found that the futures trading net- 
works exhibit such features as scale-free, small-world effect, hierarchical organization, and power-law betweenness 
distribution 16211 . 

To our knowledge, studies about the trading networks of all traders involved in the transactions of individual stocks 
are still lack. In this work, we attempt to fill this gap by investigating the trading networks using the order flow data 
of a highly liquid stock named Shenzhen Development Bank listed on the Shenzhen Stock Exchange, an emerging 
market. In consistence with many empirical social complex networks, the stock trading networks exhibit power-law 
degree distributions. This indicates that there are very few investors with many trading connections while there are 
many investors with only a few links. We further find the disassortative architecture of the trading networks and the 
power-law correlation between order sizes and degrees, which motivates us to employ the fitness model to explain the 
apparent power-law degree distribution. 

The paper is organized as follows. In Sec. |2l we briefly describe the trading rules of Shenzhen Stock Exchange 
and the data sets we adopt. Section[3]defines "trade" and its size and studies the empirical distribution of trade sizes. 
We provide an overview of the trading networks in Section |4] and investigate their global properties in Section |5j 
The fitness model is adopted to uncover the observed power-law degree distribution in Section |6l At last. Section Q 
concludes. 



2. Trading rules and data sets 

Shenzhen Stock Exchange, which constitutes the stock markets in Chinese mainland with Shanghai Stock Ex- 
change, provides the market place and facilities for securities trading. Usually, the exchange is open for trading from 
Monday to Friday, except public holidays and other days as announced by the exchange. On each trading day, open 
call auction is held between 9:15 and 9:25, followed by cool period from 9:25 to 9:30. Before July 1, 2006, The 
continuous trading was from 9:30-11:30 and 13:00-15:00. Note that after July 1, 2006, the continuous trading is from 
9:30-11:30 and 13:00-14:57, followed by closing call auction between 14:57-15:00. 

Generally speaking, investors can submit market orders and limit orders and cancel unfilled orders. However, only 
limit orders are allowed for submission in Chinese stock markets in 2003. When the price of an incoming limit order 
is not less aggressive, transaction occurs. The incoming Umit order might be partially filled or fully filled. A fulled 
filled order or the filled part of a partially filled order can be regarded as an effective market order, while an unfilled 
order or the unfilled part of a partially filled order is treated as an effective limit order Without loss of clarity, we 
term them as market order and limit order for simplicity. During different trading periods, the trading system will 
take different treatments to the submitted orders. Our first step is to reconstruct the limit order book based on the 
order flow data for each trading days. Hence, it is necessary to elaborate to the order executing regulations for each 
trading period (opening call auction, cool period, and continuous double auction). It is worth pointing out that all the 
submitted orders are sorted according to the principle of price-time priority. 
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During the opening call auction, all the submitted orders are added into the limit order book. This means that 
the trading system regards all the submitted orders as limit orders. Cancelation order submitted before 9:20 will 
be executed instantly after its submission and cancelation orders submitted between 9:20 to 9:25 are frozen which 
take into effect only after the call auction. At 9:25, the call auction transactions occur based on the principle of 
maximizing the trading volume. The remaining orders in the order book matched with the cancelations, which are 
submitted between 9:20 and 9:25, are canceled by the trading system. The unexecuted orders automatically enter the 
cool period. During the cool period, the incoming market orders will be executed instantly by matching the orders 
on the opposite side, while the limit orders are added into the limit order book. However, the cancelations are not 
executed right after it was submitted. At 9:30 am, cancelations of remaining orders on the book will be centralized 
processed by the trading system. The unexecuted orders automatically enter the continuous double auction period. 
During the continuous double auction, the limit orders are added into the limit order book. All the market orders and 
order cancelations will be executed right after they are submitted. Before the closure of market, the unexecuted orders 
are automatically canceled by the trading system. 

Our analysis is based on the order flow data for a stock named Shenzhen Development Bank Co., LTD (A-share, 
stock code: 000001) during the whole year of 2003 (totally 237 trading days) 163, 64]. Each entry (or event) of the 
data contains the time of order placement or cancelation that is accurate to 0.01 second, direction of buy and sell, 
order size, limit price, and the encrypted identity of the trader. Table [T] reports a segment of taken from the order 
flow data recorded on 2003/01/02. The indicator is a buy-sell identifier, which identifies whether a record is a buy 
order, a sell order, or a cancelation. The database totally records 3,925,832 events submitted by 595,836 investors, 
including 1,718,156 buy orders, 1,595,961 sell orders, 598,750 cancelations and 12,965 invalid orders. Note that the 
institutional investors only account for a very small fraction (0.55%) of total investors 16511 . which is irrelevant to the 
order splitting strategies of institutions that result in order packages Using this database, we can rebuild 

the limit order book according to the trading rules and construct the trading networks for each trading day. The 
reconstructed transaction records are compared with the historical price and volume series to ensure the reliability 
of the obtained limit order book. It is evident that the size of our data set is much larger than the Austrian trading 
network lf56ll and the experimental market IstIi . 



Table 1 : A segment of the order flow data 



time 


ID 


indicator 


order price 


order size 


9342152 


a 


28 


10.20 


300 


9342174 


b 


28 


10.19 


300 


9342322 


c 


23 


10.20 


11000 


9342332 


d 


25 


10.22 


4000 


9342371 


e 


26 


10.21 


300 


9342382 


f 


28 


10.05 


1000 



We define transaction ratio r to characterize how many orders out of the total submitted orders are eventually 
executed. The transaction ratio r (respectively, r-^^k and rbid) is nothing but the ratio between the number of executed 
orders (ask orders and bid orders) and the number of total placed orders (ask orders and bid orders). Fig. [T] illustrates 
the evolution of ratios (r in the low panels, rask in the middle panels, and rud in the up panels) with respect to trading 
days. We find that there is no remarkably trend in the three curves and most of the values are centered around their 
means. On average, 67.3% (respectively, 68.8%, 74.3%) of the total orders (ask orders, bid orders) are matched. 

3. Definition of trade size and its probability distribution 

According to the trading rules, one transaction occurs when a bid order and an ask order are matched. We define 
the corresponding size of order matching as trade size v. The trade size here is the minimum quantity of trading 
volume at the microstructure level of stock market and has the physical meaning of transferring a certain amount of 
shares from a seller to a buyer. In order to have a better understanding of the definition of trade size, we present a 
schematic diagram of order matching in Fig.|2l The diagram describes the matching situation that a market bid order 
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Figure 1: (Color online) Plots of the transaction ratios r. Cask, and j-bij with respect to the trading days. 



matches with the Hmit orders at the best ask price, which gives the best ask price as a transaction price. The right bar is 
a queue of effective sell limit orders submitted by five investors h, i, j, I and m at successive times f/, < f,- < tj <ti< tm 
with sizes Sask,;i = 200, ^ask,/ = 100, iaskj = 300, iask,/ - 100, and Sask,™ = 200. When an effective buy market order 
(represented by the left bar) with size ^bid.x = 500 is submitted by investor x, it will match three different ask orders 
placed by investor h, i, and j, which leads to the bilateral transactions between investor li and x with trade size as 
v/u = 200, between ; and x with v,^ - 100, and between j and x with vj^ = 200. We treat these as three trades, which 
is different from the conventional meaning of trade with size o) defined according to the aggressive side and 
we have w.v = v^x + vix + vj^ in the present case. 



Sbid, x = 5Q0\ 



v^x = 200 



v,x = 100 



Vhx = 200 



I 



Sask, rn = 200 

•S'ask, I = 100 

Sask, J = 300 

Sask, 1 = 100 

Sask, h = 200 



Figure 2: (Color online) Diagram illustrating the definition of trade size v. Five investors h, i, j, I and m submitted effective hmit orders with sizes 
■*ask,A = 200, iask,/ = 100, .Vask.j = 300, i^sk,/ = 100, and .Vask.m = 200 at successive times f/, < f, < tj < ti < t,„, stored as a queue at the best ask 
price in the ask order book. Investor x then submitted an effective buy market order of size ibid.A- = 500 at time tj^ > t,„ . The matching of the bid 
order and three different ask orders leads to three different transactions. 



The statistical properties of trade size oj and trading volume have attracted considerable interests in the Econo- 
physics community, attributing to the fact that the understanding of their behaviors is particularly relevant to the price 
formation dynamics. The distributions of trade size co for stock markets are found to exhibit power-law tail distribu- 
tions. However, there has been a heated controversy about whether the estimated power-law tail exponents locate in 
the Levy regime (y,, < 2). By adopting different estimating method olog ies, some groups find that the tail exponents 
are in the Levy regime ItS'IIIj while some others disagree ItI, 72, 78-81 |. Here we contribute to this literature by 
investigating the distribution of trade size v defined in Fig.|2] 
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Fig. Ha) illustrates the cumulative probability distribution of trade sizes v for three typical trading days (January- 
03-2003, April-10-2003, and October-20-2003). One can see that there is an asymptotical power-law decay when v is 
larger than Vmin, such that 



C(v) 



if V > V 



— ^'min- 



(1) 



In order to confirm our observations of power-law trade size distributions, a strategy is adopted as a standard procedure 
to check whether a given sample is drawn from a specific distribution, which is to fit the empirical data with maximum 
likelihood estimation and test the goodness of fits with the Kolmogorov-Smirnov (KS) statistic. Recently, Clauset, 
Shalizi and Newman (CSN) have proposed a power-law fitting approach for estimating the power-law exponent of 
empirical data as well as the low bound of the power-law behavior p82Il. The approach combines maximum-likelihood 
fitting methods with goodness-of-fit tests based on the Kolmogorov-Smirnov statistic and likelihood ratios. 
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Figure 3: (Color online) Distribution of trade sizes v. (a) Cumulative probability distribution of trade sizes for three typical dates (January-03-2003, 
April-10-2003, and October-20-2003). The solid lines are the best fits to the power-law tails, (b) Scatter plot of the pairs (I'min, Tv) for different 
trading days. Each point corresponds to the low boundary of power law v,^ii, and the power law exponent y,.. The marker X means the trading day 
on which the scaUng range for power-law tail is not less than one order of magnitude and the marker + represents the trading day on which the 
scaling range is less than one order of magnitude, (c) Empirical probability distiibution of trade sizes v over entire investigated period. The solid 
line is the best fit to a power-law distribution. 



We proceed the fitting procedure on the trade size samples by applying the above-mentioned powerful power-law 
calibrating technique for all the trading days. Calibrating results of three typical days are chosen as examples to 
illustrate the power-law tail distributions. As illustrated in Fig. [3 a), the open markers are the empirical distributions 
and the solid lines are the best fits to the power-law distributions, which gives jy - 1 .76 (Vmin = 2749) for January- 
03-2003, r,, = 1.27 (Vmin = 2500) for April-10-2003, and y,, = 1.31 (v^n = 2300) for October-20-2003, respectively. 
The obtained calibrating parameters I'min and y,, are then substituted into the KS test procedure to test the goodness of 
fitting. The null hypothesis Hq for our KS test is that the data (v > v,nin) are drawn from a power-law distribution. We 
find that the null hypothesis Hq cannot be rejected at the significant level of 0.01 for all the days. Fig.[3](c) depicts 
the bivariate distribution of pairs (Vmin, y, ). Each node is associated with the low bound of power-law tail v'min and 
the estimated power-law exponent y,,. We utilize the notation sr to represent the scaling range of the power-law tail 
and distinguish sr > I and ir < 1 by marker x and +. sr > 1 means the scaling range is not less than one order of 
magnitude, and sr < I represents the opposite. The percentage of trading days on which the scaling range of the trade 
sizes spans not less than one order of magnitude is around 73.42% in the samples. Concerning the trading days with 
the scaling ranges no less than one order of magnitude, there are 97.70% days whose power-law exponents are well 
within the Levy-stable regime. In addition, we also study the probability distribution of the trade sizes over the entire 
time period investigated, as illustrated in Fig. [3] (c). A nice power-law tail distribution is observed and maximum 
likelihood estimate also gives the tail exponent y = 1 .7, which is well inside Levy stable regime. 



4. Stock trading networks 

4.1. Construction 

In order to construct the stock trading networks, we have to reconstruct the limit order book based on the trading 
rules and to extract the detailed information of each transaction. A transaction is triggered by an incoming market 
order matched with the limit orders waiting on the opposite order book and accomplished by transferring shares from 
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seller to buyer and cashes from buyer to seller, as sketchily illustrated in Fig. [2] It provides an opportunity to trace 
the order execution procedure from a complex network perspective. Hence, we can mathematically write the trade 
sizes, defined in Sec. [3l between pairwise traders into aN xN matrix V, on trading day t. Its entry Vjp corresponds to 
the trade size, which represents the number of shares flowing from investor / to investor j. We only take into account 
directed networks here, therefore we have Vip + vji^,. The adjacent matrix is defined as A, - [a,;.,], where aip = I if 
Vip > and Uip = if x'tp = 0. A schematic diagram is illustrated in Fig.|4] which is a part of the trading network 
constructed from the limit order book data on October 9, 2003. 

4.2. Evolution of network size 

The network size varies with respect to the trading days f, for no two arbitrary days share the same trading 
activities. We utilize four quantities to describe the network size: the number of traders, the number A^ask of traders 
who submitted ask orders, the number A^bid of traders who submitted bid orders, the number Ng of trades (or edges in 
the trading network). We have an inequality ^ A^ask + A^bid, since it is possible that some traders submit both buy 
and sell orders on the same day. We plot the number of traders A^, ask-traders A^ask, bid-traders A^bid, and trading edges 
Ne with respect to the trading day t in the up panel of Fig. |5] From the observation that the A^ and A^,. curves almost 
are overlapping together, one can infer that A^^, is a little larger than N, which indicates that the constructed network 
resembles a tree-like network. This leads to the fact that the clustering coeflicient of trading networks is very low. We 
also present the prices of our investigated stock in the corresponding time interval and find that A^, A^ask, M)id, and A^^ 
are all synchronous with the stock prices. The corresponding correlation coefficients are 0.44 for price and A^, 0.36 
for price and A^ask, 0.45 for price and A^bid, and 0.44 for price and Ne, respectively. 

4.3. Largest component 

Many real world networks are fragmented into isolated subnetworks or "components". A component is a part of 
a network, in which two arbitrary nodes are connected by at least one path, while there are no edges connecting two 
different components. Among these components, the largest component which encompasses the largest number of 
nodes is considered as the most important component in the network. The evolution of the relative size tlc of the 
largest component that is defined as the ratio of the number of nodes in the largest component to the total number 
of nodes in a network is illustrated in the up panel of Fig. |6] We also show as the relative size r2LC of the second 
largest component for comparison. One finds that the minimum tlc is greater than 95% in our investigated samples. 
In contrast, the second largest component only contains a very small fraction of the nodes in the system. Our analysis 
indicates that most of the traders are connected by a giant component. It is even interesting to investigate the corre- 
lation relationship between tlc (also r2Lc) and financial variables, including volatility and total trading volume. We 
employ the price range defined as the difference of maximum price and minimum price (in logs) as the measure of 
volatility for each trading day. The correlation coefficients are 0.42 for tlc and volatility, -0.30 for r2LC and volatility, 
0.42 for tlc and volume, -0.29 for r2LC and volume, respectively. 




Figure 4: Part of the directed stock trading network constructed from the limit order book data on October 9, 2003. The nodes represent investors 
in the market and the directed edges stand for transactions. 
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Figure 5: (Color online) Plots of the number of traders A', ask-traders Na, bid-traders Nt, and trades in the up panel. For comparison, the stock 
prices are also plotted in the low panel. 

l.OOi ■ ■ ■ 1 




Figure 6: (Color online) Plots of the relative sizes of the largest component tlc (up panel) and the second largest component r2LC (low panel). 



5. Network degrees 

Degree is regarded as one of most important quantities in complex networks, which describes how many neighbors 
each node has. Here, in our trading networks, degree can be interpreted as the number of trading counterparties of 
each investor. Because our constructed trading network is a directed network, we define three types of degree to 
illustrate the trade relationship in stock market, including the ask-degree (out degree) k^^^ = Y^iAjj, the bid-degree (in 
degree) kbi^ = Tij^ij^ ^nd the total degree k = fcask + ^bid for each trader We note that 2 k^^]^ - XI ^bid for all traders 
due to the buy-sell parity. 

5.1. Evolution of average degrees 

Fig.|2](a) illustrates the evolution of the daily average values of (fcask), (^bid), and (k). Comparing with the stock 
prices, one can see that (k^sk) > (kbid) in the bearish range and (^bid) > (^ask) in the bullish ranges. We also observe that 
there are many sharp spikes for (A;bid) and especially for (k-^sk)- In order to give an interpretation of this phenomenon, 
we plot the daily average degrees and the daily average order sizes (s) on the same coordinates. As illustrated in Fig.|7] 
(b), pairs of [{k^sk) (left y-axes), (sask) (right y-axes)] are sketched in the upper panel and pairs of [(A:bid) (left y-axes), 
(■Sbid) (right y-axes)] are presented in the lower panel. These sharp spikes are a consequence of the appearance of 
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relatively large values of average order sizes. However, the average values of (k) do not have as much fluctuations 
as (k^sk) and (A:bid), which has a less fluctuated value close to 2.29 + 0.12 (mean + std). This is due to the facts that 
the average degrees can also be calculated from (k) - IN^/N and that the two curves of A^^, and in Fig. |5] almost 
superpose. 
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Figure 7: (Color online) Analysis of average degrees, (a) Evolution of the average bid-degree (k^sk)^ ask-degree <<:bid)> and total degree (k) with 
respect to the trading days. For illustration of bearish and bullish period, separated by two vertical lines, the stock price is presented in the low 
panel, (b) Pairs of [(i:ask) (Isft y-axes), (s^sk) (right y-axes)] and [(i'bid) (Isft y-axes), {ibid) (right y-axes)] evolving with the trading days. 



5.2. Probability distribution 

After understanding the evolution behavior of the average degrees, we further investigate the probability distribu- 
tion of the three kinds of degrees for each trading day. Fig.[8]illustrates the cumulative distribution of ^ask, ^bid, and 
k for three randomly chosen days (Mar-05-2003, May- 13-2003, and Dec-17-2003). We can find that the cumulative 
probability distribution function C{k) asymptotically decays as a power law with respect to k, 

cfck)~Ck"\ c(^ask) ~ ^bTd", c{k)~k-y. (2) 

We adopt the same strategy utilized in Sec.|3]to confirm our observations of power-law degree distributions. First, we 
calibrate the empirical data by the CSN approach [[82|. Second, we perform KS test forjudging the goodness-of-fit 
of the fitting distribution compared with the empirical distribution. The filled markers are the empirical distributions 
and the solid lines are the best fits to the power-law function form, which gives yask - 1-32 (A:ask, min = 3), ybid = 1-93 
(^bid, min = 3), and y = 1.56 (^min = 3) for Mar-05-2003, yask =1.21 (^ask min = 3), y^d = 1-05 (^bid, min = 5), and 
y = 1 .49 (^,„„ = 3) for May- 1 3-2003, and yask = 1.51 (^ask, min = 4), ytid = 1 .96 (^b.d, min = 3), and y = 1 .7 (fe„i„ = 4) 
for Dec-17-2003, respectively. 

The KS test is implemented on the calibrating results to check the goodness-of-fit. Our null hypothesis Ho is 
the degrees can be well modeled by a power-law distribution. At a significant level of 0.01, we find that the null 
hypothesis Ho cannot be rejected for a fraction 100% (99.6%, 98.7%, respectively) of the trading days (237 days in 
total) for ask-degree k^^k (bid-degree A:bid, total degree k). Fig. |9] depicts the bivariate distribution of pairs (^min.y) 
for the samples which pass the KS test. Each node is associated with the low bound of power-law tail ky^^ and the 
estimated power-law exponent y. The inset shows the frequency of power-law exponents. The average values of the 
power-law exponents for the whole samples are yask = 1.42+0.22 (mean + std), ybid = 1.69+0.26, and y = 1.52+0.12, 
which are well within the Levy stable regime. We notice that the degree distribution of the directed Austrian trading 
network also falls in the Levy stable regime llsill . 

5.3. Disassortativity 

The average degree of nearest neighbor k„n{k) for vertices with degree k is usually adopted to quantitatively 
characterize the architecture of the investigated networks. If kn„{k) has an ascending (respectively, descending) form 
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Figure 8: (Color online) Degree distribution distribution for three typical days (Mar-05-2003, May-13-2003, and Dec-17-2003). The filled mai'kers 
are the empirical distribution and the soHd Hnes are the best power-law fits to data according to the CSN approach. The data for Mar-05-2003 and 
Dec-17-2003 are translated by a factor of 100 and 0.01 for better visibility. 



with respect to k, one can argue that the network is assortative (disassortative). The average degree of nearest neighbor 
k„„ i for node / is defined by 

^nnj — 'j (3) 

k 

where k is the degree of node / and kj stands for the degree of the j-th neighbor of node /. Then the values of k„„ can 
be obtained through averaging over A;„„ , of all the nodes with the same degree k. 

In order to gain better statistics, we regard all the nodes from different networks as an ensemble set, which is parti- 
tioned into 18 non-overlapping groups. In the partitioning procedure, we equally divide the interval [ln[min(A:)], ln[max(A;)]] 
into 18 groups and assign each node into the corresponding group if the degree of the node locate in the interval of 
the group. In each group, the average value (k) of k and the associated average value {k„„} of k„„j are determined. In 
our case, the neighbors of asking (bidding) nodes are bidding (asking) nodes. The relationship between {k„„} and (k) 
is plotted in Fig.[TO]for asking nodes and bidding nodes. Both curves share the same decreasing form, which implies 
the disassortative configuration in the trading networks. 

5.4. Correlation between order sizes and degrees 

The trading host is a queuing system. The limit orders are sorted according to the principle of price-time priority 
in the limit order book. A transaction occurs when an incoming order matches with the limit order waiting on the 
opposed side. Normally, the market orders with large sizes will eat up a relatively large number of limit orders at the 
opposite side and the limit orders with lager size could consume a relatively large number of market orders as well. 
This may lead to the correlation between the network degrees and the order sizes. In order to verify this conjecture, 
we extract the information about the total order size s and degree k for each stock trader on each trading day. To gain 
better statistics, all the records of different trading days are regarded as an ensemble set. We logarithmically equally 
partition the interval [min(s), max(i)] into 24 groups and assign each trader into the corresponding group if the order 
size locate in the interval of the group. For each group, the mean and standard deviation of order sizes and degrees 
are determined. However, we only get 16 data points for other 8 groups are empty sets. Fig.[TT]illustrates the errorbar 
plots of average degrees with respect to average order sizes for 16 groups in double logarithmic coordinates. The 
order size s exhibits an asymptotical power-law behavior with respect to k when s > 3000, 

k~ sP. (4) 

The least-squares fits to the data provide an estimation of the power-law exponents, such that ySbid = 0.92+0.02 for buy 
orders and ySask = 0.84 + 0.02 for sell orders. This analysis demonstrates the positive correlation between order sizes 
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Figure 9: (Color online) Bivariate distribution of pairs (/cmin.T) for different trading days. Each point corresponds to the low boundary of power 
law and the power law exponent y. The inset shows the empirical frequency of y. 
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Figure 10: (Color onhne) Plots of </:bid. mi) (respectively, (k-^sk, mi)) with respect to /c^sk (^bid)- The decreasing curves indicate that the stock trading 
networks have disassortative architectures. 



and degrees, which inspires us to regard the order size as a fitness variable in fitness model and to give an explanation 
of the power-law degree distributions in the trading networks 1,26,, 83>, 84)1 . 



6. Fitness model 

Variety of empirical complex networks extracted from different fields exhibit power-law degree distributions, and 
two fundamental mechanisms are proposed for the understanding of the ubiquitously emerging scaling-free behav- 
iors. One is "rich-get-richer", and the other is "good-get-richer". The concept of "rich-get-richer", also known as 
preferential attachment, is well captured by the Barabasi- Albert model, in which networks proliferate at a constant 
rate and new nodes are added to existed nodes with a probability p{k). The connected probability p{k) of each node is 
linearly increasing to its degree, which indicates the nodes with high degrees will have high priority to acquire links 



from newly arriving nodes 111 111 . The model of "good-get-richer" is introduced to interpret some empirical networks 



with scale-free behavior stemming from the vertex intrinsic fitness 118311 . Garlaschelli and LofFredo have utilized this 
model by assigning the gross domestic product (GDP) as the vertex fitness to each node, which is able to reproduce 
the topological properties of world trade webs 12611 . 
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Figure 1 1 : (Color online) Plots of the coirelation between order sizes .v and degrees k for buy and sell orders. The data for sell orders are shifted by 
a factor of 10 for better visibility. The solid lines are the best power-law fits to the data. 

From the observation of the power-law correlation between order size s and degree k, we conjecture that the 
appearance of power-law degree distributions results from the order sizes. To verify our conjecture, the fitness model 
is adopted to simulate the stock trading networks. We define a set of nodes with fitnesses, in which the nodes are the 
investors and the fitnesses are their order sizes s from the original order flow data. The network-building algorithm is 
proposed as follows. Two nodes ; and j are randomly chosen from the set and a transaction occurs between the two 
nodes whose trade size is determined by the smaller order, v - min(i,, sj). Then we subtract the trade size from the 
two original order sizes, s; = s; - v and sj = sj - v. If the remaining order size equals to zero, the corresponding node 
is removed from the set. After all investors are removed, the process ceases. 

For each trading day, we obtain 1000 synthetic networks generated from the network-building algorithm. We 
adopt the same strategy utilized in Section[3]to check whether the degrees of each synthetic network are drawn from a 
power-law distribution. We define Pmodei as the percentage of the synthetic networks whose degree distributions pass 
the KS test at a significant level of 0.01 for a given trading day. As illustrated in Fig. [12] (a), the percentage pmodei 
is plotted with respect to the trading days t for ask-degree fcask, bid-degree fcbid, and degree k, respectively. We find a 
fraction 99.6% (respectively, 97.9%, 95.4%) of the trading days (237 days in total) on which the percentage Pmodei is 
larger than 99% for ask-degree k^sk (bid-degree ^bid, total degree k). Fig.[T2](b) presents the scatter plots of yreai with 
respect to ymodei- One can observe that most of the points locate above the solid line ymodei - Treab indicating that 
the power-law exponents obtained from fitness model are lager than that of real market. The deviations between yreai 
and ymodei reflect the fact that the considerations of the sequence and the ask or bid prices are also important in the 
matching mechanism. 

7. Concluding remarks 

In this paper, we have performed an investigation of the statistical properties of stock trading networks based 
on the order flow data of a high-liquidity stock listed on Shenzhen Stock Exchange (Shenzhen Development Bank) 
during the whole year of 2003. 

For each trading day, we have reconstructed the limit order book and extracted the detailed information of each 
executed order (including the order matching, trade price, and the corresponding trade size). The trade sizes are found 
to exhibit power-law distributions and most of the estimated power-law exponents are well inside the Levy regime. 

Based on the detailed trading information, one can establish stock trading networks, in which the investors rep- 
resent nodes and each transaction is translated as an directed edge drawn from a seller to a buyer with the trade size 
as its weight. Each network contains a largest component, which contains more than 95% of investors. Furthermore, 
the trading networks surprisingly have power-law degree distributions and disassortative architectures. The fitness 
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model is employed to interpret the consequence of power-law distributions, which are motivated by the power-law 
correlation between order sizes and degrees. 

Trading networks have provided an opportunity to trace the order execution process in financial markets from the 
perspective of complex networks. Future works for the application of trading networks are necessary to be carried 
out. For instance, one can try to investigate the relations between network variables and traditional financial variables, 
between local network structures and abnormal trading behaviors, between network eigenvalues and abnormal traders, 
and so on. We believe that trading network analysis is a potentially useful tool for analyzing the microstructure of 
financial markets l85-90ll. 
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